♦ 

AATATTTTCCTTGACCTAATGCCATCTTGTGTCCCCTTGCAGAGCCCTATTCCTAACATGGCTGATGACTA 
TGGCTCTGAATCCACATCTTCCATGGAAGACTACGTTAACTTCAACTTCACTGACTTCTACTGTGAGAAAA 
ACAATGTCAGGCAGTTTGCGAGCCATTTCCTCCCACCCTTGTACTGGCTCGTGTTCATCGTGGGTGCCTTG 
GGCAACAGTCTTGTTATCCTTGTCTACTGGTACTGCACAAGAGTGAAGACCATGACCGACATGTTCCTTTT 
GAATTTGGCAATTGCTGACCTCCTCTTTCTTGTCACTCTTCCCTTCTGGGCCATTGCTGCTGCTGACCAGT 
GGAAGTTCCAGACCTTCATGTGCAAGGTGGTCAACAGCATGTACAAGATGAACTTCTACAGCTGTGTGTTG 
CTGATCATGTGCATCAGCGTGGACAGGTACATTGCCATTGCCCAGGCCATGAGAGCACATACTTGGAGGGA 
GAAAAGGCTTTTGTACAGCAAAATGGTTTGCTTTACCATCTGGGTATTGGCAGCTGCTCTCTGCATCCCAG 
AAATCTTATACAGCCAAATCAAGGAGGAATCCGGCATTGCTATCTGCACCATGGTTTACCCTAGCGATGAG 
AGCACCAAACTGAAGTCAGCTGTCTTGACCCTGAAGGTCATTCTGGGGTTCTTCCTTCCCTTCGTGGTCAT 
GGCTTGCTGCTATACCATCATCATTCACACCCTGATACAAGCCAAGAAGTCTTCCAAGCACAAAGCCCTAA 
AAGTGACCATCACTGTCCTGACCGTCTTTGTCTTGTCTCAGTTTCCCTACAACTGCATTTTGTTGGTGCAG 
ACCATTGACGCCTATGCCATGTTCATCTCCAACTGTGCCGTTTCCACCAACATTGACATCTGCTTCCAGGT 
CACCCAGACCATCGCCTTCTTCCACAGTTGCCTGAACCCTGTTCTCTATGTTTTTGTGGGTGAGAGATTCC 
GCCGGGATCTCGTGAAAACCCTGAAGAACTTGGGTTGCATCAGCCAGGCCCAGTGGGTTTCATTTACAAGG 
AGAGAGGGAAGCTTGAAGCTGTCGTCTATGTTGCTGGAGACAACCTCAGGAGCACTCTCCCTCTGAGGGGT 
CTTCTCTGAGGTGCATGGTTCTTTTGGAAGAAATGAGAAATACATGAAACAGTTTCCCCACTGATGGGACC 
AGAGAGAGTGAAAGAGAAAAGAAAACTCAGAAAGGGATGAATCTGAACTATATGATTACTTGTAGTCAGAA 
TTTGCCAAAGCAAATATTTCAAAATCAACTGACTAGTGCAGGAGGCTGTTGATTGGCTCTTGACTGTGATG 
CCCGCAATTCTCAAAGGAGGACTAAGGACCGGCACTGTGGAGCACCCTGGCTTTGCCACTCGCCGGAGCAT 
CAATGCCGCTGCCTCTGGAGGAGCCCTTGGATTTTCTCCATGCACTGTGAACTTCTGTGGCTTCAGTTCTC 
ATGCTGCCTCTTCCAAAAGGGGACACAGAAGCACTGGCTGCTGCTACAGACCGCAAAAGCAGAAAGTTTCG 
TGAAAATGTCCATCTTTGGGAAATTTTCTACCCTGCTCTTGAGCCTGATAACCCATGCCAGGTCTTATAGA 
TTCCTGATCTAGAACCTTTCCAGGCAATCTCAGACCTAATTTCCTTCTGTTCTCCTTGTTCTGTTCTGGGC 
CAGTGAAGGTCCTTGTTCTGATTTTGAAACGATCTGCAGGTCTTGCCAGTGAACCCCTGGACAACTGACCA 
CACCCACAAGGCATCCAAAGTCTGTTGGCTTCCAATCCATTTCTGTGTCCTGCTGGAGGTTTTAACCTAGA 
CAAGGATTCCGCTTATTCCTTGGTATGGTGACAGTGTCTCTCCATGGCCTGAGCAGGGAGATTATAACAGC 
TGGGTTCGCAGGAGCCAGCCTTGGCCCTGTTGTAGGCTTGTTCTGTTGAGTGGCACTTGCTTTGGGTCCAC 
CGTCTGTCTGCTCCCTAGAAAATGGGCTGGTTCTTTTGGCCCTCTTCTTTCTGAGGCCCACTTTATTCTGA 
GGAATACAGTGAGCAGATATGGGCAGCAGCCAGGTAGGGCAAAGGGGTGAAGCGCAGGCCTTGCTGGAAGG 
CTATTTACTTCCATGCTTCTCCTTTTCTTACTCTATAGTGGCAACATTTTAAAAGCTTTTAACTTAGAGAT 
TAGGCTGAAAAAAATAAGTAATGGAATTCACCTTTGCATCTTTTGTGTCTTTCTTATCATGATTTGGCAAA 
ATGCATCACCTTTGAAAATATTTCACATATTGGAAAAGTGCTTTTTAATGTGTATATGAAGCATTAATTAC 
TTGTCACTTTCTTTACCCTGTCTCAATATTTTAAGTGTGTGCAATTAAAGATCAAATAGATACATTAAGAG 
TGTGAAGGCTGGTCTGAAGGTAGTGAGCTATCTCAATCGGATTGTTCACACTCAGTTACAGATTGAACTCC 
TTGTTCTACTTCCCTGCTTCTCTCTACTGCAATTGACTAGTCTTTAAAAAAAAGTGTGAAGAGTAAGCAAT 
AGGG AT AAGGAAAT AAGATC T (SEQ ID NO: 1) 



MADDYGSESTSSMEDYVNFNFTDFYCEKNNVRQFASHFLPPLYWVFIVGALGNSLVILVYWYCTRVKTMTD 
MFLLNLAIADLLFLVTLPFWAIAAADQWKFQTFMCKWNSMYKMNFYSCVLLIMCISVDRYIAIAQAMRAH 
TWREKRLLYSKMVCFTIWVLAAALCIPEILYSQIKEESGIAICTMVYPSDESTKLKSAVLTLKVILGFFLP 
FWMACCYTIIIHTLIQAKKSSKHKALKVTITVLTVFVLSQFPYNCILLVQTIDAYAMFISNCAVSTNIDI 
CFQVTQTIAFFHSCLNPVLYVFVGERFRRDLVKTLKNLGCISQAQWVSFTRREGSLKLSSMLLETTSGALS 
L (SEQIDNO.2) 



FIGURE 1 



Underlined = deleted in targeting construct 

Bold = sequence flanking Neo insert in targeting construct 



aatattttccttgacctaat6ccatcttgtgtccccttgcagagccctattcctaacatg 

gctgatgactatggctctgaatccacatcttccatggaagactacgttaacttcaacttc 

actgacttctactgtgagaaaaacaatgtcaggcagtttgcgagccatttcctcccaccc 

ttgtactggctcgtgttcatcgtgggtgccttgggcaacagtcttgttat ccttgtctac 

tggtactgcacaagagtgaagaccatgaccgacatgttccttttgaatttggcaattgct 

gacctcctctttcttgtcactcttcccttctgggccattgctgctgctgaccagtggaag 

ttccagaccttcatgtgcaaggtggtcaacagca tgtacaagatgaacttctacagctgt 

gtgttgctgatcatgtgcatcagcgtggacaggtacattgccattgcccaggccatgaga 

gcacatacttggagggagaaaaggcttttgtacagcaaaatggtttgctttaccatctgg 

gtattggcagctgctctctgcatcccagaaatcttatacagccaaatcaaggaggaatcc 

ggcattgctatctgcaccatggtttaccctagcgatgagagcaccaaactgaagtcagct 

gtcttgaccctgaaggtcattctggggttcttccttcccttcgtggtcatggcttgctgc 

tataccatcatcattcacaccctgatacaagccaagaagtcttccaagcacaaagcccta 

aaagtgaccatcactgtcctgaccgtctttgtcttgtctcagtttccctacaactgcatt 

t tgttggtgcagaccattgacgcctatgccatgttcatctccaactgtgccgtttccacc 

aacattgacatctgcttccaggtcacccagaccatcgccttcttccacagttgcctgaac 

cctgttctctatgtttttgtgggtgagagattccgccgggatctcgtgaaaaccctgaag 

aacttgggttgcatcagccaggcccagtgggtttcatttacaaggagagagggaagcttg 

aagctgtcgtctatgttgctggagacaacctcaggagcactctccctctgaggggtcttc 

tctgaggtgcatggttcttttggaagaaatgagaaatacatgaaacagtttccccactga 

tgggaccagagagagtgaaagagaaaagaaaactcagaaagggatgaatctgaactatat 

gattacttgtagtcagaatttgccaaagcaaatatttcaaaatcaactgactagtgcagg 

aggctgttgattggctcttgactgtgatgcccgcaattctcaaaggaggactaaggaccg 

gcactgtggagcaccctggctttgccactcgccggagcatcaatgccgctgcctctggag 

gagcccttggattttctccatgcactgtgaacttctgtggcttcagttctcatgctgcct 

cttccaaaaggggacacagaagcactggctgctgctacagaccgcaaaagcagaaagttt 

cgtgaaaatgtccatctttgggaaattttctaccctgctcttgagcctgataacccatgc 

caggtcttatagattcctgatctagaacctttccaggcaatctcagacctaatttccttc 

tgttctccttgttctgttctgggccagtgaaggtccttgttctgattttgaaacgatctg 

caggtcttgccagtgaacccctggacaactgaccacacccacaaggcatccaaagtctgt 

tggcttccaatccatttctgtgtcctgctggaggttttaacctagacaaggattccgctt 

attccttggtatggtgacagtgtctctccatggcctgagcagggagattataacagctgg 

gttcgcaggagccagccttggccctgttgtaggcttgttctgttgagtggcacttgcttt 

gggtccaccgtctgtctgctccctagaaaatgggctggttcttttggccctcttctttct 

gaggcccactttattctgaggaatacagtgagcagatatgggcagcagccaggtagggca 

aaggggtgaagcgcaggccttgctggaaggctatttacttccatgcttctccttttctta 

ctctatagtggcaacattttaaaagcttttaacttagagattaggctgaaaaaaataagt 

aatggaattc ac ctttgc atc ttttgtgtc tttc ttatc atgatttggc aaaatgc atc a 

cctttgaaaatatttcacatattggaaaagtgctttttaatgtgtatatgaagcattaat 

tacttgtcactttctttaccctgtctcaatattttaagtgtgtgcaattaaagatcaaat 

agatacattaagagtgtgaaggctggtctgaaggtagtgagctatctcaatcggattgtt 

cacactcagttacagattgaactccttgttctacttccctgcttctctctactgcaattg 

actagtctttaaaaaaaagtgtgaagagtaagcaatagggataaggaaataagatct 



FIGURE 2A 



Gene Sequence 
Structure * 

Size of CDS: 2577 bp 



231 bp 



Sequence Deleted 



394 bp 



Targeting Vector* (genomic sequence) 
Construct Number: 3340 



Arm Length: 
5': 2.7 kb 
3": 4 kb 



Targeting Vector 

----- Endogenous Locus 

* Not drawn to scale 




5' arm 



LacZ-Neo 
Cassette 



3* arm 




5 ' > T ATTC C TT AC AG AGC C T T ATT 
C CTGGC ATGTTTGATGAC TTC AGC 
TATGACTCCACTGCTTCCACAGAT 
GACTACATGAATTTGAATTTCAGT 
AGCTTCTTCTGTAAGAAAAATAAT 
GTC AGGC AGTTTGC AAGC C ATTTT 
CTCCCACCTCTGTACTGGCTTGTG 
TTCATTGTGGGCACCTTGGGCAAC 
AGC C TGGTC AT < 3 ' 
(SEQ.ID.N0.3) 



5 1 >TGTACAAGATGAACTTCTACA 
GC TGTGTGC TTCTC ATC ATGTGC A 
TCAGTGTGGACAGATACATTGCCA 
TTGTACAGGCCATGAAGGCTCAGG 
TCTGGAGGCAGAAAAGGCCGCTAT 
ACAGCAAGATGGTCTGCATTACCA 
TCTGGGTGATGGCAGCTGTGCTCT 
GCACCCCAGAAATCCTGTACAGTC 
AAGTC AGTGGG< 3 ■ 
(SEQ. ID. NO. 4) 
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